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Abstract 

There are claims that there is correlation between 
the speed of center of mass of the solar system and 
the global temperature anomaly. This is partly 
grounded in data analysis and partly in a priori 
expectations. The magnitude squared coherence 
function is the proper measure for testing such 
claims. It is not hard to produce high coherence 
estimates at periods around 15-22 and 50-60 years 
between these data sets. This is done in two inde¬ 
pendent ways, by wavelets and by a periodogram 
method. But does a coherence of high value mean 
that there is coherence of high significance? In or¬ 
der to investigate that, four different measures for 
significance are studied. Due to the periodic nature 
of the data, only Monte Carlo simulation based on 
a non-parametric random phase method is appro¬ 
priate. 

None of the high values of coherence then turn 
out to be significant. Coupled with a lack of a phys¬ 
ical mechanism that can connect these phenomena, 
the planetary hypothesis is therefore dismissed. 


1 Introduction 

The center of the mass of the solar system moves 
up to about one sola diameter due to the influence 
of in particular the gas giants Jupiter and Saturn. 
A 2-D view of the orbit is shown in Fig. [T] Can 
the signature of this orbit be found in the global 
temperature? 

In order to argue this case one needs two kinds 
of evidence. The first and most important one is 
an explanation in terms of a physical model that 
explains why this orbit should influence processes 


in the sun which in turn will influence the tem¬ 
perature on earth. There is no such explanation 
which is generally accepted today, see e.g. |Calle-| 
baut et al. (2012), despite the existence of alterna¬ 


tive theories (Scafetta 2012) and recent discussions 


(Abreu et al. 

2012 Cameron and Schiissler 

2013 

Cauquoin et a 

1. 2014 

Poluianov and Usoskin, 

2014 

Abreu et al. 

>0141. 


The second way is to compare data and look 
for correlation and coherence. This is the ap¬ 
proach followed by Scafetta (2010). He compared 
the HadCRUT3 global temperature anomaly with 
the speed of the center of mass of the solar system 
(SCMSS). The raw data is plotted in Fig. [t] and 
runs from 1850 to May 2014. The most obvious 
feature of the temperature data (upper curve) is a 
gradual rise which is captured imperfectly by the 
linear and parabolic fits. The secondary feature is 
a pattern of oscillations which is what this paper 
is concerned with. Scafetta is to be commended 
for having drawn the attention to these oscillations 
and for raising various hypotheses for their origin. 
The most obvious feature of the SCMSS data in 
the lower part of Fig. [2] is a periodicity of about 
20 years which is the synodic period of Jupiter and 
Saturn, i.e. the period for which their positions and 
that of the Sun are realigned. 


The main argument of Scafetta (2010) was based 
on a comparison of power spectra which allegedly 
demonstrated coherence, and in particular at pe¬ 
riods around 20 and 60 years. Then in Holm| 


(20 14a|b| ) I argued that this is only a qualitative 
comparison. There has for a long time been avail¬ 
able a much better tool and this is the magni¬ 
tude squared coherence (MSC) function. It is a 
frequency dependent normalized correlation value 
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Figure 1: Orbit of the center of mass of the solar 
system from 1980 to 2014 generated by the Hori¬ 
zons system of JPL 



Figure 2: Global 


CRUT3, Brohan et al. 
mean 


temperature anomaly 
upper curve, 


(2006), 


Had- 

with 


linear and quadratic fits and speed of cen¬ 
ter of mass of the solar system (SCMSS) generated 
by the Horizons system of JPL, lower curve, with 
mean value 


between 0 and 1 which is also sensitive to phase re¬ 
lationships. The MSC was therefore introduced to 
this problem and it was shown that in order to get 
a statistically reliable estimate of coherence from 
the 160 or so years of data available, one could not 
average over windows which were much longer than 
40 years. This was due to the particular significance 
interval estimate that was used in that paper. It 
was therefore not possible reliably to resolve coher¬ 
ence at a period around 60 years. 

As a response to this, it was argued in |Scafetta| 
(2014) along three lines. First the similarity of 


spectral estimates of climate and planetary se¬ 
ries was again used as an argument as Fig. 6B 


Third he argued that with this high resolution MSC 
estimator, one could increase the window length 
to 110 years. With windows of this length, a line 
around 60 years should be resolvable, and such a 
line came out of his analysis as well. 

I don’t see this as a discussion of right or wrong 
methodology, at least not as long as one is talk¬ 
ing about estimating the MSC. It is rather about 
significance and questions such as if a high coher¬ 
ence value automatically implies that it has high 
significance, and how important the reliability of 
the coherence estimator’s amplitude is. Since there 


from 

Scafetta 

2010 

) was reproduced as Fig. 12 of dence in the amplitude estimate, given the limited 

Scafetta 

(2014 

i. He also reproduced the results time-span, then indirectly this is also a question of 


from an undocumented coherency test. Contrary 
to him, I actually see the opposite of coherence in 
this figure as the spectral lines do not overlap very 
convincingly at frequencies around the 20 and 60 
year periods. 

In the rest of Scafetta’s paper the MSC of 


resolution. My main aim in this paper is to show 


that the primary difference between Scafetta (2014) 


and Holm (2014a) is the weight placed on these 


Holm (2014a) has now been adopted rather than 
a comparison of spectral estimates. His sec¬ 
ond argument was that one needs to improve the 
MSC estimator based on averaged windowed peri- 


properties. 

A secondary difference is the emphasis placed 
on a priori expectations. They are fine for fram¬ 
ing a hypothesis, but not necessarily for defend¬ 
ing it when data goes against the hypothesis. But 


this is what Scafetta (2014) does when he invokes 


odograms and cross-spectra of Holm (2014a) and 


replace it with the minimum variance distortion¬ 


less (MYDR) MSC estimator (Benesty et al., 2006). 


Kepler’s first and second laws as well as Kepler’s 
Mysterium Cosmographicum (1596, 1621) and Har- 
monices Mundi (1619) to argue against my analy¬ 
sis since it didn’t find 60 year periodicities of the 
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strength that he anticipated. 
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Further, it is not at all obvious that Kepler’s 
contributions, beyond his three laws, are so use¬ 
ful. Here is what a critical historian of science has 
written: '’The three major gems in his works on 
astronomy lay in a vast field of errors, of irrele¬ 
vant data and details, of mystical fantasies, of use¬ 
less speculations, of morbid detours of self analysis, 
and, last but not least, of an organismic and ani¬ 
mistic conception of the world and its processes” 


(Jaki 1978). Anyway, to me this is a simple ques¬ 


tion of analysis of the data and not of any a priori 
expectations. The SCMSS data used in |Scafetta| 
(2010) was clearly not available to Kepler and con¬ 
tains whatever it contains regardless of what expec¬ 
tations one may get from reading Kepler. Could it 
also be that the seemingly arbitrarily chosen scalar 
SCMSS may be a poor choice if the objective is to 
illustrate the 60 year periodicity of the planetary 
system, and that for instance it is easier to find it 
if the 3D vector information is used as well? 


In this paper I will take it as a premise that com¬ 
parisons of spectral peaks cannot be anything but 
a qualitative indicator of coherence and therefore 
argue only based on the magnitude squared coher¬ 
ence. I then discuss the effect of detrending on the 
global temperature data. A standard wavelet co¬ 
herence estimation routine which is widely used in 
climate and geophysical data analysis is then ap¬ 
plied to the data. I also demonstrate that it is not 
necessary to introduce a new MVDR MSC estima¬ 
tor to see peaks around 60 years, the periodogram 
based estimator may also show that if the window 
length is increased. In order to validate these peaks 
their statistical significance must be found. I dis¬ 
cuss four ways of doing that and end up with a 
random phase method which has been designed to 
be applicable to serially correlated data as one has 
here. This is a new method compared to that used 


Holm (2014a). I will demonstrate that with com¬ 


mon values of significance levels it is hard to argue 
for the statistical significance of coherence between 
the data sets for both the 20 year and the 60 year 
periodicities. 
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Figure 3: Detrended global temperature anomaly. 
Upper: linear detrending as used in this study, 
lower: parabolic detrending 


2 Estimation of Magnitude 
Squared Coherence 

Two independent estimation methods, wavelet co¬ 
herence and periodogram based coherence, are used 
here. An alternative which is not discussed is 
multitaper MSC estimation. It has been used 
for instance to demonstrate coherence between the 


global temperature and C02 (Kuo et al. 1990). 


Another possibility is the MVDR estimator for 
MSC of Benesty et al. (20061 used by Scafetta 


(2014). These results are straight-forward to repro¬ 


duce and the method was used by Scafetta because 
it was believed that the periodogram based MSC 
estimator could not resolve the interesting line at 
a 60 year period. As will be shown here this is not 
the case, so therefore the relatively uncharacterized 
MVDR estimator which also depends on the set¬ 
ting of a regularization parameter is not used here. 
In general statistical properties are much harder 
to find for MVDR-based estimators. This is the 
case also in other fields where we have experience 
with MVDR or it equivalents, the Capon method 


and adaptive spectral estimation (Synnevag et al. 


2007). 


2.1 Detrending 

The MSC results are sensitive to the kind of de¬ 
trending applied to the global temperature data. 
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Alternatives are to subtract the mean, to subtract be found I 

Carter 


L987 1 

/Vang et al. 

2004 

). The 

a linear trend, or to subtract a parabola. These method was used in 

Holm 

(2014a 

). Since in partic- 


curves fitted to the raw data are shown in Fig. [2] 
Subtraction of the mean is obvious to do in any 
spectral method in order to prevent strong compo¬ 
nents at 0 Hz to leak into the low frequency parts. 
The most critical period in question is at around 
60 years. This is a frequency of only 1/60 year^ 1 
which is very close to 0 and in the frequency range 
which is affected most by detrending. 

The safest way to justify linear and parabolic de¬ 
trending is to argue in terms of a physical model 
for the data. Lacking that, another criterion could 
be to test for sensitivity. If a result is critically 
dependent on one particular detrending, then the 
result would be dubious. The temperature data 
detrended by a linear fit and a parabolic fit were 
therefore plotted in Fig. [3| Compared to the raw 


ular the SCMSS data is highly correlated due to its 
periodic nature, this will tend to indicate coherence 
where in reality there is none. 

No analytical expression for the statistics exists 
in case of correlated data so one has to resort to 
Monte Carlo simulation to find a test for the null 
hypothesis. In order to generate a data set to test 
against, there are several possibilities. One way is 
to fit an AR(1) model to each data set and gener¬ 
ate simulated data sets from this red noise model 


(Grinsted et ah, 2004). This method will also indi¬ 


cate coherence when there is none when there are 
serially correlated and in particular periodic signals 


present (Poluianov and Usoskin 2014). 


The non-parametric random phase method of 


Ebisuzaki (1997) is better as it was developed for 


serially correlated data. Several studies have used 


particular the lower series, detrended by a parabolic 

this method 

Moron et al. 

2008 

Traversi et al. 

function, shows a tendency to amplify a low fre- 

2012 

Poluianov and Usoskin 

2014 

). It consists of 


quency oscillation in particular in the range 1850- 
1890 but also after 2000. 

These differences can be seen in the MSC es¬ 
timates also. Parabolic detrending enhances low 
frequency periodicites more than the other meth¬ 
ods whether the periodogram based method or the 
MVDR method is used for MSC estimation. This 
will be commented on later also. In my view, 
parabolic detrending adds uncertainty to the result. 
Are we seeing phenomena that are in the data or 
are we seeing the result of an interaction between 
the detrending and the data? 

A middle way, using subtraction of the mean for 
the SCMSS data and linear detrending for the tem¬ 
perature data has therefore been chosen here. The 
latter is also justified by the argument in |Ebisuzakij 


Fourier transforming a data set, randomizing the 
Fourier phases while maintaining the complex con¬ 
jugate symmetry of the frequency domain data so 
that the inverse transform is still real, and doing 
an inverse transform to get back to the time do¬ 
main. Ebisuzaki (1997) recommends that the pair¬ 


wise coherence between a large set (1000 in our 
case) random phase versions of time series 1 and 
the original time series 2 is found, and then that 
the results are sorted and the 95% percentile is plot¬ 
ted along with the coherence of the two datasets. 
That paper also showed that as an AR(2)-process 
became more and more periodic, this method tends 
to assign coherence where there is none. Therefore 
we have adopted the method of the Appendix of 


Traversi et al. (2012) where for each iteration also 


(1997) that linear detrending removes the influence 


of potential unresolved low frequencies. The careful 
analysis of linear detrending on temperature data 


Kuo et al. (1990) also justifies it. 


the coherence between a random phase version of 
series 2 and the original series 1 is found and the 
maximum of the two coherence functions is found 
for each step. The use of an additional phase crite¬ 
rion, as proposed there, is not used here. 


2.2 Statistical significance 

An analytical expression for the statistics of the 
MSC estimator exists for the case of an integer 
number of non-overlapping segments and Gaussian 
uncorrelated (white) data. To test the null hypoth¬ 
esis of no coherence, the true value can be set to 
zero. In that case an independence threshold can 


The Monte Carlo simulation in Scafetta (2014) is 


yet another way to attack this problem. However, 
since it is based on comparing the SCMSS with sim¬ 
ulated sinusoids in additive independent noise at 
the expected frequencies, it is based on something 
different from a null hypothesis. It starts with the 
assumption of coherence and tests its significance 
rather than starting with the hypothesis that there 
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Figure 4: Wavelet spectra of linearly detrended 
HadCRUT3 (top) and zero-mean SCMSS (bottom) 


is no coherence. This goes against the principle of 
prudence in my opinion. It also depends on the 
value of a signal to noise ratio parameter. The 
setting of this parameter is not discussed at all in 


Scafetta (2014) and an arbitrary value is just used 
without justification. Finally, this additive noise 
approach, which some might say is rather naive, is 
not even considered in papers that discuss how to 
find confidence intervals for serially correlated data 


(Ebisuzaki 1997). 


2.3 Wavelet coherence analysis 

The wavelet estimation software of lGrinsted et all 


(2004) has been used for many different climate re¬ 
lated and astrophysical applications. It estimates 
spectra and magnitude squared coherence using the 
Morlet wavelet. In order to show where the time- 
frequency space is free of edge artifacts, the data is 
only plotted inside the cone of influence, highlight¬ 
ing the lack of data for estimating components at 
high periods. 

This method is applied to the detrended data 
of Fig. [2l It can be seen that the wavelet spectra 
of Fig. m are in general agreement with the spec¬ 
tral analysis of these data sets in Holm (2014a b). 
A line around 60 years is very visible in the Had- 
CRUT3 spectrum, but barely visible in the SCMSS 
spectrum. In addition the SCMSS spectrum has a 
strong component around 20 years which is also 
visible in the HadCRUT3 spectrum. 

The wavelet squared coherence function is shown 


Figure 5: Wavelet magnitude squared coherence es¬ 
timate for linearly detrended HadCRUT3 vs zero- 
mean SCMSS. The contour shows the 95% sig¬ 
nificance level based on the random phase model 


of Traversi et al. (2012); Poluianov and Usoskin 


(2014). The arrows show relative phase relation¬ 
ships with an arrow pointing to the right indicating 
in-phase variation 


in Fig. [5} It is based on averaging of two indepen¬ 
dent values in the time dimension and in the scale 
dimension, so the number of independent averages 


is Nd = 4 (see Appendix of Torrence and Webster 


(1999)). The confidence interval based on the ran¬ 
dom phase method from the Appendix of Traversi| 
et al. (2012) and Poluianov and Usoskin (20141) is 


used. There are only a few areas with significant co¬ 
herence, like period ~ 8 years around 1950. There 
is quite large coherence in the 17-22 and the 50- 
60 year ranges, but it is not large enough to reach 
above the 95% significance level. This is different 
from the result of Holm (2014a) where coherence in 
the range 15-17 years was found to be just above 
the significance threshold. The lack of significant 
coherence in Fig. [5] is due to the use of a signifi¬ 
cance interval estimate which is better suited for 
the serially correlated data. 


2.4 Periodogram based coherence 
estimate 

In the periodogram based method for estimating 
magnitude squared coherence, the detrended time 
series is divided into short equal-length windowed 
segments. Then power and cross spectra are found 
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Figure 6: Periodogram magnitude squared coher¬ 
ence of linearly detrended HadCRUT3 and zero- 
mean SCMSS found by averaging 13 overlapping 
segments of length 41 years, N ( [ ss 6.4 independent 
averages under a white noise assumption. The 95% 
significance threshold is shown as a dash-dotted line 



Figure 7: Periodogram magnitude squared coher¬ 
ence of linearly detrended HadCRUT3 and zero- 
mean SCMSS found by averaging 6 overlapping seg¬ 
ment of length 73 years, Nd ss 3.4 independent av¬ 
erages under a white noise assumption. The 95% 
significance threshold is shown as a dash-dotted line 


by averaging over all the segments. The MSC esti¬ 
mate is found as the ratio of the squared cross spec¬ 
trum estimate and the power spectra. The number 
of averages is given by the number of windowed 
segments. In order to utilize the data well, con¬ 
siderable overlap between segments is used. In our 
case we use a Kaiser window with /3 = 6 and the 
overlap is OL = 75%. That means that the num¬ 
ber of independent averages, Nd, which determines 
confidence intervals, is considerably lower than the 
number of averages. See the Appendix for details 
of the algorithm and its properties. 


the 95% confidence level, but is below it. One 
would have expected the solid MSC curve to reach 
well above the dash-dotted 95% significance level 
if there was significant coherence.There is of course 


no disagreement between Scafetta (2014) and I that 


In Holm (2014a I the MSC was plotted for win¬ 


dow lengths 20 and 30 years, and results were given 
in table format for length 40 years also. Here I 
don’t preselect the window length but rather find 
it by increasing it until the required number of seg¬ 
ments fills all the available data as much as possible 
giving window lengths like 41, 73, and 109 years. 
Despite this, some of the data will have to be dis¬ 
carded, but this is done from the beginning rather 
than from the end of the dataset. The reasoning 
is that the data from 1850 is both less reliable and 
also less interesting than recent data. See Fig. [9] in 
the Appendix for an illustration of this. 

The magnitude squared coherence using a win¬ 
dow length of 41 years is shown in Fig. [6] The 
coherence in the 15-17 year range almost touches 


windows of length 30-40 years are too short for 
finding coherence between time series on the 60 
year scale. But as segment lengths are increased 
from 41 to 109 years, the number of independent 
averages, N d , goes from 6.4 to 1.9. Due to the cor¬ 
related nature of the data, this is the upper limit of 
Nd, and the actual number of independent averages 
is even lower. This number is so low that signifi¬ 
cance levels based on the analytical model of |HoIin| 
(2014a]) were not considered to be reliable. There¬ 


fore the window length was not increased beyond 
that in the first study. 

Now that significance levels are found by Monte 
Carlo simulation, larger segment lengths like 73 
and 109 years can give meaningful results. This 
is shown in Figs. [7] and [8] It is seen that a peak 
in the 50-60 year range starts to appear in addi¬ 
tion to the one around 19 years. These results are 
similar to Fig. 10 with 110 years segment length 


of Scafetta (2014) with respect to the peak around 


60 years, but not identical to it. The coherence es¬ 
timate here is below the 95% significance level, in 
particular in the 50-60 year range. 
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Figure 8: Periodogram magnitude squared coher¬ 
ence of linearly detrended HadCRUT3 and zero- 
mean SCMSS found by averaging 3 overlapping seg¬ 
ments of length 109 years, Nd w 1.9 independent 
averages under a white noise assumption. The 95% 
significance threshold is shown as a dash-dotted line 


case for the two independent methods, the wavelet 
coherence estimator of Fig. [sland the periodogram 
based estimator of Figs. [6]-18] At first glance this 
seems to be in agreement with Scafetta (2014). 


However, an estimate of coherence of high value 
does not mean that there is a coherence of high 
significance. Therefore a central point of this paper 
has been a discussion of significance levels. The 
challenge is that the data here is periodic. The 
four methods for assessing significance were: 


1. Null hypothesis testing via Monte Carlo sim¬ 
ulation based on the non-parametric ran¬ 
dom phase method for serially correlated data 


Ebisuzaki 

19971 

Traversi et al. 

2012 


2. Null hypothesis testing based on an analytical 
expression for independence level derived for 


white Gaussian data (Holm 2014a) 


The MSC peak at about 60 years in Fig. [8] is 
above 0.9 while Scafetta’s was below 0.8. The 
parabolic detrending of his Fig. 10 has actually en¬ 
hanced the peak, and a change to linear detrend¬ 
ing, like we use, will diminish Scafetta’s MVDR 
peak even further to around 0.7. Despite the higher 


3. Null hypothesis testing via Monte Carlo sim- 


value of our peak compared to Scafetta (2014) it 


ulation based on a red noise model (Grinsted 


et al. 2004) 


4. 


Testing via Monte Carlo simulation based on a 
simple model with sinusoids in additive noise 
with an unknown signal to noise that needs to 


turns out not to reach above the dash-dotted 95% 
significance level. 

The period of the main peak in Figs. [6]-[8]is 17.5, 
19, and 18.4 respectively. This is slightly less than 
20 years, the peak value of the SCMSS spectrum. 
However, due to the ratio in the definition, Eq. [l] 
the actual peak also depends on the cross spectrum 
as well as the resolution of the spectral estimates. 
Also it is evident from the upper panel of Fig. [4] 
that the peak position in the 20 year range for the 
temperature data varies with time, and this varia¬ 
tion is smeared out in the MSC estimate. 


3 Discussion and conclusion 

It is not hard to find high peaks in the magnitude 
squared coherence estimate in the 15-20 year range 
as well as in the 50-60 year range when the speed 
of center of mass of the solar system is compared 
to the global temperature anomaly. This is the 


be set (Scafetta 2014) 


These methods will give widely different results 
with respect to significance. Prudent use of meth¬ 
ods implies that conclusions only can be drawn 
from the first model as it is the only one that fits 
the data, as has been argued here. That means 
that coherence around 20 year and 60 year peri¬ 
ods cannot be considered to be trustworthy. This 


strengthens the conclusion of Holm (2014a*). 


The principle of prudence in addition to the lack 
of an accepted physical mechanism, then dictates 
the conclusion that one cannot say that there is 
any coupling between the movement of the cen¬ 
ter of the solar system and the global temperature 
data. It is more credible to look for down to earth 
explanations in e.g. oscillations in the atmospheric 
general circulation model, the Pacific decadal os¬ 
cillation, or the Atlantic multi-decadal oscillation 


due to the meridional overturning circulation (Gao 


et al. 

2014 

Sun et al. 

2015 

Yao et al. 

2015 
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A Estimation of magnitude 
squared coherence 


The magnitude squared coherence (MSC) is defined 
as a normalized cross spectrum: 


C X y(f) 


\P*y(f )\ 2 

Pxx(f) ■ Pyy (/) 


(1) 


where P xx (f) and P vy (f) are the power spectra of 
the two time series and P xy (/) is the complex cross 
spectrum. C xy {f) will always be between 0 and 
1. For the estimate both time series are divided 
into N segments which are Fourier transformed and 
averaged: 


Cxy(f) = 


IEl~oA,(/)TO)l 2 


En=0 \ X n(fW En=0 \ Y n(f)\ 


N—l 


( 2 ) 


A case that demonstrates the need for averaging 
is to let N = 1. Then the coherence estimator of 
Eq. [2] will always output unity no matter what the 
input signals are and therefore be severly biased 
upwards. Such upward bias is typical when the 
number of averages is low. 

For averaging, each segment in time is weighted 
before Fourier transformation. Each segment over¬ 
laps with its neighbors as depicted in Fig. [9] for 
a Kaiser window with /3 = 6. Since the window 
tapers the ends heavily down, OL = 75% overlap 
is used between the N segments. The number of 
segments is: 


N = 1 + 


I N t0 t N Se g . 

l N seg (1^0L) } 


(3) 


where N to t is the total number of samples and N seg 
is the segment length. 

The actual value for the number of independent 
averages, Nd, will be less than N when there is 
overlap, but statistics for the MSc estimator is only 
known for the case of an integer number Nd > 2 



Figure 9: N = 6 shifted and weighted windows 
with 75% overlap, segment length N seg = 73 years 


of non-overlapping segments (|Carter| 1987 Wang 


et al. 2004). However, one can assume that the 


reduction in the number of independent averages is 
similar to that which happens when estimating a 


spectrum (Holm 2014a). In the case of an overlap 


of OL = 75%, the number of independent aver¬ 
ages, Nd, is estimated from the correlation of the 
data window at the p = 25, 50, and 75% points, 


c(p), Harris (1978), assuming that the input data 
is white: 


7V- 1 = ^[1 + 2c 2 (0.75) + 2c 2 (0.5)] 


N 2 


[c 2 (0.75) + 2c 2 (0.5) + 3c 2 (0.25)] 


(4) 


The ratio Nd/N in the case of a Kaiser(6) window 
and 75% overlap is asymptotically 0.69, starting 
at 0.53 for very low values of N. The difference 
between these values is due to the last three terms 
which can be neglected for N > 10. However, due 
to the assumption on the data, this estimate will 
overestimate the number of independent averages 
in the case of non-white correlated data, and in 
particular when it is periodic. 

A typical advice regarding the required number 
of independent averages is found in | Carter et al.| 
(1973) where it is said that the statistical prop¬ 
erties ”... dramatically portray(s) the requirement 
that Nd be large. ” What this means in practice is 
that Nd should preferrably be larger than 4-6 as 


Holm (2014a) where the lowest value used was 


Nd = 7.1. However, with a Monte Carlo based 
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method for finding significance level, rather than 

an analytic one, lower values of N r j can give mean¬ 
ingful results as demonstrated in this paper. 
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